# Visual-Language Interaction
Qwen2 VL 7B Visual Rft Lisa IoU Reward
Apache-2.0
Qwen2-VL-7B-Instruct is a vision-language model based on the Qwen2 architecture, supporting multimodal input of images and text, suitable for various visual-language tasks.
Image-to-Text
Safetensors English
Q
Zery
726
4
Chat Vector Llava V1.5 7b Ja
A visual-language model capable of conducting dialogues in Japanese about input images, created using the Chat Vector method by combining weights from multiple models
Image-to-Text
Transformers Japanese

C
toshi456
26
1
Featured Recommended AI Models